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Abstract 

In this paper, we study combinatorial and structural properties of a new class of finite and 
infinite words that are 'rich' in palindromes in the utmost sense. A characteristic property 
of so-called rich words is that all complete returns to any palindromic factor are themselves 
palindromes. These words encompass the well-known episturmian words, originally introduced 
by the second author together with X. Droubay and G. Pirillo in 2001. Other examples of rich 
words have appeared in many different contexts. Here we present the first unified approach to 
the study of this intriguing family of words. 

Amongst our main results, we give an explicit description of the periodic rich infinite words 
and show that the recurrent balanced rich infinite words coincide with the balanced episturmian 
words. Wc also consider two wider classes of infinite words, namely weakly rich words and 
almost rich words (both strictly contain all rich words, but neither one is contained in the 
other). In particular, we classify all recurrent balanced weakly rich words. As a consequence, 
we show that any such word on at least three letters is necessarily episturmian; hence weakly 
rich words obey Fraenkel's conjecture. Likewise, we prove that a certain class of almost rich 
words obeys Fraenkel's conjecture by showing that the recurrent balanced ones are episturmian 
or contain at least two distinct letters with the same frequency. 

Lastly, we study the action of morphisms on (almost) rich words with particular interest 
in morphisms that preserve (almost) richness. Such morphisms belong to the class of P- 
morphisms that was introduced by A. Hof, O. Knill, and B. Simon in 1995. 
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1 Introduction 

In recent years there has been growing interest in palindromes in the field of combinatorics on 
words, especially since the work of A. de Luca [12] and also X. Droubay and G. Pirillo [II], who 
showed that the well-known Sturmian words are characterized by their palindromic complexity [TJ 
Ul[7]- A strong motivation for the study of palindromes, and in particular infinite words containing 
arbitrarily long palindromes, stems from their application to the modelling of quasicrystals in 
theoretical physics (see for instance |11[ 119]) and also Diophantine approximation (e.g., see [16]). 

In [13], the second author together with X. Droubay and G. Pirillo observed that any finite 
word w of length \w\ contains at most \w\ + 1 distinct palindromes (including the empty word). 
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Even further, they proved that a word w contains exactly \w\ + 1 distinct palindromes if and only 
if the longest palindromic suffix of any prefix p of w occurs exactly once in p (i.e., every prefix 
of w has Property Ju [13]). Such words are 'rich' in palindromes in the sense that they contain 
the maximum number of different palindromic factors. Accordingly, we say that a finite word 
w is rich if it contains exactly \w\ + 1 distinct palindromes (or equivalently, if every prefix of w 
has Property Ju). Naturally, an infinite word is rich if all of its factors are rich. In independent 
work, P. Ambroz, C. Frougny, Z. Masakova, E. Pelantova [2] have considered the same class of 
words which they call full words, following earlier work of S. Brlek, S. Hamel, M. Nivat, and 
C. Reutenauer in [7J. 

In [13], the second author together with X. Droubay and G. Pirillo showed that the family of 
episturmian words [131 121] . which includes the well-known Sturmian words, comprises a special 
class of rich infinite words. Specifically, they proved that if an infinite word w is episturmian, then 
any factor u of w contains exactly \u\ + 1 distinct palindromic factors. (See [51 118} 123] for recent 
surveys on the theory of Sturmian and episturmian words.) Another special class of rich words 
consists of S. Fischler's sequences with "abundant palindromic prefixes", which were introduced 
and studied in |15j in relation to Diophantine approximation (see also |16|). Other examples of 
rich words have appeared in many different contexts; they include the complementation- symmetric 
sequences [1], certain words associated with /3-expansions where (3 is a simple Parry number [2], 
and a class of words coding r-interval exchange transformations [4] . 

In this paper we present the first study of rich words as a whole. Firstly, in Section [21 we 
prove several fundamental properties of rich words; in particular, we show that rich words are 
characterized by the property that all complete returns to any palindromic factor are palindromes 
(Theorem 12. 14[) . We also give a more explicit description of periodic rich infinite words in Sec- 
tion [3] (Theorem 13. 1|) . 

In Section U] we define almost rich words: they are infinite words for which only a finite number 
of prefixes do not satisfy Property Ju. Such words can also be defined in terms of the defect of 
a finite word w, which is the difference between \w\ + 1 and the number of distinct palindromic 
factors of w (see the work of Brlek et al. in [7] where periodic infinite words with bounded defect 
are characterized). With this concept, rich words are those with defect and almost rich words are 
infinite words with bounded defect. 'Defective words' and related notions are studied in Section[4l 
where we also introduce the family of weakly rich words (which includes all rich words), defined 
as infinite words with the property that all complete returns to letters are palindromes. 

In Section [5] we consider applications to the balance property: an infinite word over a finite 
alphabet A is balanced if, for any two factors u, v of the same length, the number of x's in each 
of u and v differs by at most 1 for each letter x € A. (Sturmian words are exactly the aperiodic 
balanced infinite words on two letters.) First we describe the recurrent balanced rich infinite words: 
they are precisely the balanced episturmian words. We then go much further by classifying all 
recurrent balanced weakly rich words. As a corollary to our classification, we show that any such 
word on at least three letters is necessarily episturmian. Consequently, weakly rich words obey 
Fraenkel 's conjecture [T7J . We also prove that a certain class of almost rich words obeys Fraenkel's 
conjecture by showing that the recurrent balanced ones are episturmian or contain at least two 
distinct letters with the same frequency. 

Lastly, in Section [U we study the action of morphisms on (almost) rich words with particular 
interest in morphisms that preserve (almost) richness. Such morphisms belong to the class of P- 
morphisms that was introduced by A. Hof, O. Knill, and B. Simon in [19] (see also the nice survey 
on palindromic complexity by J. -P. Allouche, M. Baake, J. Cassaigne, and D. Damanik PQ). 
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1.1 Notation and terminology 

In what follows, A denotes a finite alphabet, i.e., a finite non-empty set of symbols called letters. 
A finite word over A is a finite sequence of letters from A. The empty word e is the empty 
sequence. Under the operation of concatenation, the set A* of all finite words over A is a free 
monoid with identity element e and set of generators A. The set of non-empty words over A is 
the free semigroup A + := A* \ {e}. 

A (right) infinite word x is a sequence indexed by N + with values in A, i.e., x = X1X2X3 ■ ■ ■ with 
each Xi G A. The set of all infinite words over A is denoted by «4 W , and we define A°° := A* \JA U . 
An ultimately periodic infinite word can be written as uv^ = uvvv ■ ■ ■ , for some u, v G A*, v 7^ e. 
If u = e, then such a word is periodic. An infinite word that is not ultimately periodic is said 
to be aperiodic. For easier reading, infinite words are hereafter typed in boldface to distinguish 
them from finite words. 

Given a finite word w = x\X2 ■ ■ ■ x m G A + with each Xj G A, the length of w, denoted by 
is equal to m and we denote by \w\ a the number of occurrences of a letter a in w. By convention, 
the empy word is the unique word of length 0. We denote by w the reversal of w, given by 
w = x m ■ ■ ■ X2X1. If w = w, then w is called a palindrome. The empty word e is assumed to be a 
palindrome. 

A finite word z is a factor of a finite or infinite word w G A°° if w = uzv for some u £ A*, 
v G A°°. In the special case u = e (resp. v = e), we call z a prefix (resp. suffix) of ui. The set 
of all factors of w is denoted by F(w) and the alphabet of w is Alph(u>) := F(w) D A We say 
that F(it;) is closed under reversal if for any it G F(w), u G F(w). When w = ps £ A + , we often 
use the notation p~ 1 w (resp. ws" 1 ) to indicate the removal of the prefix p (resp. suffix s) of the 
word w. 

The palindromic (right-) closure of a word u is the (unique) shortest palindrome having 
u as a prefix [12]. That is, = uv~ l u, where v is the longest palindromic suffix of u. 

Given an infinite word x = X1X2X3 • • • , the shift map T is defined by T(x) = (xj + i)j>i and its 
/c-th iteration is denoted by T k . For finite words w G A + , T acts circularly, i.e., if w = xv where 
x £ A, then T(w) = vx. The circular shifts T k (w) with 1 < k < \w\ — 1 are called conjugates of 
w. A finite word is primitive if it is different from all of its conjugates (equivalently, if it is not a 
power of a shorter word). 

A factor of an infinite word x is recurrent in x if it occurs infinitely often in x, and x itself is 
said to be recurrent if all of its factors are recurrent in it. Furthermore, x is uniformly recurrent 
if for all n there exists a number K(n) such that any factor of length at least K{n) contains all 
factors of length n in x (equivalently, if any factor of x occurs infinitely many times in x with 
bounded gaps [TO]). 

Let A, B be two finite alphabets. A morphism ip of A* into B* is a map tp : A* — > B* such 
that ip(uv) = f(u)f(v) for any words u, v over A. A morphism on A is a morphism from A* into 
itself. A morphism is entirely defined by the images of letters. All morphisms considered in this 
paper will be non-erasing, so that the image of any non-empty word is never empty. Hence the 
action of a morphism (p on A* naturally extends to infinite words; that is, if x = X1X2X3 • • • G A u , 
then <£>(x) = (p(xi)(p(x2)(f(x3) ■ ■ ■ . An infinite word x can therefore be a fixed point of a morphism 
ip, i.e., </j(x) = x. If ip is a (non-erasing) morphism such that (f(a) = aw for some letter a G A and 
w G A + , then cp is said to be prolongable on a. In this case, the word <p n (a) is a proper prefix of 
the word ip n+l {a) for each n G N, and the limit of the sequence (<p n (a)) n >o is the unique infinite 
word: 

w = lim (p n (a) = f^ia) (= awp(w)ip 2 (w)(p 3 (w) ■ ■ ■)■ 
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Clearly, w is a fixed point of <p and we say that w is generated by (p. 

A morphism ip on A is said to be primitive if there exists a positive integer k such that, for 
all x £ A, tp k (x) contains all of the letters of A. Any prolongable primitive morphism generates 
a uniformly recurrent infinite word |27j . 

For other basic notions and concepts in combinatorics on words, see for instance the Lothaire 
books [231 [24]. 

2 Definitions and basic properties 

In this section, we prove several fundamental properties of rich words. First we recall a number 
of facts already mentioned in the introduction. 

Proposition 2.1. |13l Prop. 2] A word w has at most \w\ + 1 distinct palindromic factors. 

Definition 2.2. A word w is rich if it has exactly \w\ + 1 distinct palindromic factors. 

Definition 2.3. A factor u of a word w is said to be unioccurrent in w if u has exactly one 
occurrence in w. 

Proposition 2.4. |13} Prop. 3] A word w is rich if and only if all of its prefixes (resp. suffixes) 
have a unioccurrent palindromic suffix (resp. prefix). 

Corollary 2.5. If w is rich, then: 

i) it has exactly one unioccurrent palindromic suffix (or ups for short); 

ii) all of its factors are rich; 

Hi) its reversal w is also rich. □ 

Note, i) is Property Ju from j!3j . 

Clearly, if w has a ups, u say, then u is the only ups of w, and moreover u is the longest 
palindromic suffix of w. So if w = vu, then w^' = vuv. Furthermore: 

Proposition 2.6. Palindromic closure preserves richness. 

Proof. Let w be rich with ups u. The case w = u is trivial. Now suppose w = fu for some 
(non-empty) word / = fx, x £ A. Then = fuf = f'xuxf'. Clearly xux is a ups of fux, 
so continuing we see that all prefixes of fuf have a ups of the form huh where h is a suffix of /. 
Thus u)( + ' = fuf is rich. □ 

Proposition 2.7. // w and w' are rich with the same set of palindromic factors, then they are 
abelianly equivalent, i.e., \w\ x = \w'\ x for all letters x € A. 

Proof. Any palindromic factor of w (resp. w') ending (and hence beginning) with a letter x 6 A 
is the ups of some prefix of w (resp. w'). Thus the number of x J s in w (resp. w') is the number of 
palindromic factors ending with x. □ 

Proposition 2.8. Suppose w is a rich word. Then there exist letters x, z E Alph(ui) such that 
wx and zw are rich. 
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Proof. If w is rich and not a palindrome, then let u be its ups with \u\ < \w\. Then u is preceded 
by some letter x in w, thus wx has xux as its ups, and hence wx is rich. If, on the contrary, w is 
a palindrome, let w = xv and let u be the ups of v. If u = v then v is a palindrome so w = xv 
gives v = x n for some n whence wx = x n+2 which is rich. If \u\ < \v\ then let y be the letter 
before u in v. Then yuy is ups of vy (because u is the ups of v). If yuy is not the ups of wy then 
it is a prefix of w, but then u is a prefix of v, and as \u\ < \v\, u occurs twice in v, contradiction. 
Thus yuy is the ups of wy and this one is rich. 

In view of Proposition 12.41 one can similarly show that zw is rich for some letter z £ Alph(u;). 

□ 

Note. In the case when w is rich and not a palindrome, the fact that wx is rich for some letter 
x 6 Alph(w) is a direct consequence of Proposition 12.61 

Naturally: 

Definition 2.9. An infinite word is rich if all of its factors are rich. 

Proposition 2.10. There exist recurrent rich infinite words that are not uniformly recurrent. 

Proof. Consider the infinite word t generated by the morphism (a i— > aba, b i— > bb) from [9]. 
It suffices to show (rather easily) that t is rich. Similarly, the Cantor word of [26], or even 
abab 2 abab' i abab 2 abab 4: abab' 2 abab' i abab 2 abab 5 ■ • • (fixed point of the morphism: a i— > abab, b i— ► b) 
are recurrent rich infinite words that are not uniformly recurrent. □ 

Proposition 2.11. A rich infinite word s is recurrent if and only if its set of factors F(s) is 
closed under reversal. 

The proof of this proposition uses the following lemma. Note that "richness" is not necessary 
for the "if" part. 

Lemma 2.12. A recurrent rich infinite word has infinitely many palindromic prefixes. 

Proof. Let v\ be a non-empty prefix of a recurrent rich infinite word s. Being rich, v\ has a 
unioccurrent palindromic prefix, u\ say (by Proposition 12. 4D . Let V2 be a prefix of s containing a 
second occurrence of u±. It has a unioccurrent palindromic prefix, U2 say. Now, U2 is not a prefix 
of u\ because u\ is not unioccurrent in V2, thus \u2\ > \u\\. □ 

Remark 2.13. Although the well-known Thue-Morse word m, which is the fixed point of the 
morphism [i : a i— ► ab, b i— ► ba beginning with a, contains arbitrarily long palindromes (see 
Example 14.101 later), m is not rich. For instance, the prefix abbabaabba is not rich (since its 
longest palindromic suffix abba is not unioccurrent in it). 

Proof of Proposition [2.111 IF: Consider some occurrence of a factor u in s and let v be a prefix 
of s containing u. As F(s) is closed under reversal, v £ F(s). Thus, if v is long enough, there is 
an occurrence of u strictly on the right of this particular occurrence of u in s. Similarly u occurs 
on the right of this u and thus u is recurrent in s. 

ONLY IF: As s is recurrent, it follows from Lemma [2.121 that F(s) is closed under reversal. □ 
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Theorem 2.14. For any finite or infinite word w, the following properties are equivalent: 

i) w is rich; 

ii) for any factor u of w, if u contains exactly two occurrences of a palindrome p as a prefix 
and as a suffix only, then u is itself a palindrome. 

Proof, i) =>• ii): Suppose, on the contrary, ii) does not hold for rich w. Then w contains a non- 
palindromic factor u having exactly two occurrences of a palindrome p as a prefix and as a suffix 
only. Moreover, these two occurrences of p in u cannot overlap. Otherwise u = pv~ l p for some 
word v such that p = vf = gv = vg = p; whence v = v and u = gvg = gvg, a palindrome. So 
u = pzp where z is a non-palindromic word. We easily see that u does not have a ups; thus u is 
not rich, a contradiction. 

ii) =>• i): Otherwise, let u be a factor of w of minimal length satisfying ii) and not rich. 
Trivially \u\ > 2, so let u = xvy with x,y £ A. Then xv is rich by the minimality of u. Since 
u is not rich whilst xv is rich, the longest palindromic suffix p of u occurs more than once in u. 
Hence, by ii) we reach a contradiction to the maximality of p. □ 

Remark 2.15. Given a finite or infinite word w and a factor u of w, we say that a factor r of w is 
a complete return to u in w if r contains exactly two occurrences of u, one as a prefix and one as 
a suffix (cf. 'first returns' in [20]). With this notion, Property ii) says that all complete returns 
to any palindromic factor are themselves palindromes. In particular, consecutive occurrences of 
a letter x in a rich word are separated by palindromes. 

Note. In view of Theorem 12.144 an alternative proof of the richness of episturmian words can 
be found in the paper [3] where the fourth author, together with V. Anne and I. Zorca, proved 
that for episturmian words, all complete returns to palindromes are palindromes. See also [22] 
for further work on 'return words' in Sturmian and episturmian words. 

3 Periodic rich infinite words 

Theorem l2.14l provides a characterization of rich infinite words by complete returns to palindromes. 
We now give a more explicit description of periodic rich infinite words. 

Theorem 3.1. For a finite word w, the following properties are equivalent: 

i) w u is rich; 

ii) w 2 is rich; 

Hi) w is a product of two palindromes and all of the conjugates of w (including itself) are rich. 

Example 3.2. (aabbaabab)^ and (abcba)^ are rich. 

The proof of Theorem 13.11 requires several lemmas. In what follows, x and z always denote 
letters. 

Lemma 3.3. If u is rich and ux has a palindromic suffix r such that 2\r\ > \u\, then ux is rich. 

Proof. We can suppose r has maximal length. If r has another occurrence in ux, then, as 2|r| + l > 
\ux\, the two occurrences overlap or are separated by at most one letter. Thus they both form a 
palindrome which is a suffix of ux and is strictly longer than r, a contradiction. Therefore r is 
the ups of ux, which is rich. □ 
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Lemma 3.4. If w = pq, p, q palindromes, then w has a conjugate w' = p'q' , p', q' palindromes 
with \\p'\ - \q'\ \ < 2. 

Proof. Easy. □ 

Lemma 3.5. If w = pq, p,q palindromes, is rich and 2\q\ > \w\ — 4 (resp. 2\p\ > \w\ — \), then 
pqp (resp. qpq) is rich. 

Proof. Suppose 2\q\ > \w\ — 4 (the other case is obtained by reversal as w is rich). If pqp is not 
rich, let vz, v £ ^4*, be the shorter prefix of p such that pqvz is not rich. Further, let r be the 
longest palindromic suffix of pqvz. Then, as zvqvz is a suffix of pqvz, we have |r| > |<7| + 2\v\ + 2; 
whence 2 \r | > 2|g| +4|u| + 4 > |w|+4|u| > \w\. Then by Lemm& \3.3\ pqvz is rich, contradiction. □ 

Lemma 3.6. If w = pq, p,q palindromes, and pqp, qpq are rich, then w 2 is rich. 

Proof. If w 2 = pqpq is not rich, let vz be the shorter prefix of q such that pqpvz is not rich. As 
qpq is rich, its prefix qpvz has a ups, r say. As zvpvz is a palindromic suffix of qpvz, r must begin 
in the prefix q of qpvz. As r is not the ups of pqpvz, consider its leftmost occurrence in pqpvz. If 
the two occurrences overlap or are separated by at most one letter, both they form a palindromic 
suffix of pqpvz. As this one is not the ups of pqpvz, it has another occurrence; whence r has 
another occurrence on the left of the leftmost one, contradiction. 

Thus, the two considered occurrences of r do not overlap. This implies that the leftmost 
occurrence of r lies in the prefix pq of pqpvz, but then by reversal r also occurs in qp, hence r is 
not the ups of qpvz, contradiction. □ 

Proof of Theorem \3.1\ i) =>■ ii): Trivial. 

ii) Hi): It suffices to show that w is a product of two palindromes. Let r be the ups of w 2 . 
Then, clearly \r\ > \w\, thus r = qw and w = pq for some p, q. Therefore r = qpq = qpq, whence 
p and q are palindromes. 

ii) =>• i): We show first that w s is rich. By Hi), w 2 has a ups qw and w = pq, p, q palindromes. 
For any u, v such that uv = p, consider / = w 2 u. Observe that / has a palindromic suffix 
uqpqu which is its ups, otherwise qpq would not be the ups of w 2 . Thus all such / are rich, in 
particular w 2 p is rich. Now, if ez is a prefix of q, we show by induction on \e\ that w 2 pez is 
rich. Let r be the longest palindromic suffix of w 2 pez. As this one has suffix zepqpez, we have 
\r\ > 2\p\ + \q\ + 2 1 e | + 2; whence 2\r\ > \w 2 pez\. Thus, by Lemma 13.31 w 2 pez is rich, and hence 
w 3 is rich. 

Now denote by s n the prefix of length n of s = w^. We show by induction on n that w 3 s n is 
rich. Let r be the ups of w s n . Then r is also a suffix of w s n . Clearly these two occurrences of r 
overlap, thus both give a palindromic suffix of w 3 s n . If this one were not the ups of w 3 s n , there 
would be another occurrence of r in w 2 s n , contradiction. Thus w s s n has a ups and, as w; 3 s n _i is 
rich, it is rich too. 

Hi) => i): By Lemma 13. 4| w has a (rich) conjugate w' = p'q' with p',q' palindromes and 
I — \q'\\ < 2, whence by Lemma 13 . 5 1 p'q'p' and q'p'q' are rich. Thus by Lemma 13.61 (w 1 ) 2 is rich. 
So, using part ll ii) i)" , (w') u is rich, and so too is w u . □ 

Remark 3.7. For Hi) the hypothesis that all of the conjugates of w are rich is not sufficient: abc 
is so, but (abc)^ is not rich. The hypothesis that w is rich and a product of two palindromes is 
not sufficient: w = ba 2 bab 2 aba 2 b is a rich palindrome, but T(w) = a 2 bab 2 aba 2 b 2 is not rich. 
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4 Some related words 



4.1 Defects & oddities 

The defect [7] of a finite word w is defined by 

D(te) = \w\ + l- |PAL(tu)|, 

where PAL(u;) denotes the set of distinct palindromic factors of w (including e). This definition 
naturally extends to infinite words w € A w by setting D(w) equal to the maximum defect of the 
factors of w. In fact, this definition may be refined by observing that if u is a factor of a word v, 
then D(u) < D(v) (see 0); thus 

D(w) = max{D(u) \ u is a prefix of w}. 

With this notion, finite or infinite rich words are exactly those with defect equal to (called 
full words in (2J [7]). Accordingly, we say that an infinite word with bounded defect is almost rich. 
Such infinite words contain only a finite number of prefixes that do not have a ups. 

Notation. Let t n denote the prefix of length n of a given finite or infinite word t. 

Proposition 4.1. If t n has a ups, then D(t n ) = D(t n -i), otherwise D(t n ) = Z)(t n _i) + 1. 

Proof. If t n has a ups, then t n contains one more palindromic factor than t n _i, whence D(t n ) = 
Z)(t n _i). On the other hand, if t n has no ups, then t n has the same number of palindromic factors 
as t n _i, thus D(t n ) = D(t n -i) + 1. □ 

In other words, if t has defect k, then there are exactly k "defective" positions; hence it is 
appropriate to say that such a word t has k defects. 

Remark 4.2. A noteworthy fact is that for a given word w with k defects, the extension wx, with 
x £ Alph(io), may not have the same number of defects (in particular, the palindromic closure of 
w may have greater defect). For example, w = caca 2 bca has 2 defects, but wx has 3 defects for 
x = a, b, or a 

Periodic almost rich words have the following simple characterization. 

Theorem 4.3. A periodic infinite word w u is almost rich if and only if w is a product of two 
palindromes. 

Proof. The "if" part follows immediately from [3 Theorem 6] : if p, q are palindromes and pq is 
a primitive word, then the defect of {pq) u is bounded by the defect of its prefix of length \pq\ + 
|Mzlillj . Conversely, if w u is almost rich, then, for large enough n, w n has a ups. Thus, as in 
the proof of ii) Hi) of Theorem 13.11 we get w = qp for some palindromes p, q. □ 

Proposition 4.4. // an almost rich word t is recurrent, then F(t) is closed under reversal. 

Proof. Let u be any prefix of t with the same defect number k as t. By recurrence, we can consider 
another occurrence of u such that t = su ■ ■ ■ for some non-empty word s. Then, any suffix s'u of 
su has a ups since every prefix v of t with \v\ > \u\ has a ups (otherwise the defect of t would be 
greater than k, by Proposition 14. ip . In particular, su has a ups, p say. Now, p is not a suffix of 
u because u is not unioccurrent in su, so \p\ > \u\ and we have p = vu = uv for some non-empty 
word u. Thus u and u are both factors of t, and hence F(t) is closed under reversal. □ 
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Definition 4.5. The pair {w, w} is an oddity of a finite or infinite word t if either w or w (or 
both!) is a non-palindromic complete return to some non-empty palindromic factor oft (called 
the incriminated palindrome ). 

Note. An oddity of a finite or infinite word t takes the form pup where p is the incriminated 
palindrome and u is a non-palindromic word. Indeed, non-palindromic complete returns to any 
palindrome p are necessarily longer than 2\p\ + 1 (see the proof of Theorem 12. 14|) . 

Let O(t) denote the number of oddities of t. 
Proposition 4.6. O(t) < D(t). 

Proof. Let w = pup be an oddity of t with p the incriminated palindrome. Let n be the minimal 
integer such that w or w occurs in t n (thus as a suffix). If t n has a ups, q say, then \q\ > \p\, 
trivial. If \q\ < \w\, then the prefix p of q occurs in the interior of the complete return w = pup, 
impossible. If \q\ > \w\ then w occurs as a prefix of q, in contradiction with minimality of n. 
Thus t n does not have a ups, i.e., n is a defective position. Thus each oddity gives a defect. For 
achieving the proof we have to show that n cannot be the same for two different oddities. Suppose 
t n has a second (suffix) oddity w' = qvq. If \p\ = \q\, clearly w' = w. Otherwise let |g| < \p\ for 
instance, then q occurs twice in p, thus w' is a suffix of p, hence w' is a prefix of p, contradicting 
the minimality of n. □ 

Example 4.7. We may have O(t) < D(t); for instance the periodic word (abcabcacbacb) u has 3 
oddities (abca, bcab, cabc) ending at positions 4, 5, 6, but 4 defects at positions 4, 5, 6, 7. The 
periodic infinite word {abc) U) has a defect at each position n > 4 but only three oddities (abca, 
bcab, cabc). So infinitely many defects do not necessarily give rise to infinitely many oddities. 

Proposition 4.8. A uniformly recurrent infinite word has infinitely many oddities if and only if 
it has infinitely many palindromic factors and infinitely many defects. 

Proof. ONLY IF: Suppose s is a uniformly recurrent infinite word with infinitely many oddities. 
Clearly, s has infinitely many defects as D(s) > 0(s) by Proposition 14.61 Moreover, s must have 
infinitely many palindromic factors. Otherwise, if s contains only a finite number of palindromes, 
then each of its palindromic factors has only a finite number of different return words (and hence 
only a finite number of non-palindromic complete returns) as s is uniformly recurrent. Hence s 
has only finitely many oddities, a contradiction. 

IF: Suppose, by way of contradiction, s is a (uniformly recurrent) infinite word with infinitely 
many palindromic factors and infinitely many defects, but only a finite number of oddities. Then 
there are only finitely many palindromic factors of s that are incriminated by the oddities and 
the longest of these palindromes has length L, say. Since s contains infinitely many palindromes, 
it has infinitely many 'non-defective' positions. Thus there exists an arbitrarily large n such that 
n is a defective position and such that the prefix s n+ i = s n x, with x S A, has a ups q, with 
\q\ > L + 2. Thus, with q = xq'x, we see that q' is a palindromic suffix of s n . Let r be the longest 
palindromic suffix of s n . As n is a defective position, r has another occurrence in s n and rur is a 
non-palindromic complete return to r, which is a contradiction since |r| > \q'\ > L. □ 

Remark 4.9. Uniform recurrence is necessary for the "only if" part of the above proposition 
(but useless for the "if" part). For example, with v\ = abed and v n = v n -\(abc) n d for n > 2, 
the (non-uniformly) recurrent infinite word V1V2V3 ■ ■ ■ has infinitely many oddities, but only five 
palindromic factors: e, a, b, c, d. 
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Example 4.10. The Thue-Morse word m has infinitely many oddities. Indeed, since m is gen- 
erated by the morphism /i 2 :o^> abba, b \—> baab, m clearly contains infinitely many palindromes. 
Moreover, one can prove by induction that m has infinitely many defects occurring in runs of 
length 2 2n+1 starting at positions 2 2n+3 + 1 and 2 2n+4 + 2 2n+3 + 1 for n > 0. 

4.2 Weakly rich words 

We say that an infinite word w over A is weakly rich (or simply a WR-word for short) if for every 
a € A, all complete returns to a in w are palindromes. This class of words contains all rich words 
but is in fact a much larger class. Clearly every binary word is weakly rich but not necessarily rich. 
The periodic infinite word (aacbccbcacbcf is readily verified to be weakly rich but not rich (since 
the complete return to aa is not a palindrome). Note, however, that the family of weakly rich 
words neither contains nor is contained in the family of almost rich words. Indeed, the WR-word 
{aacbccbcacbc)^ has infinite defect (since it does not take the form (pq)^ with p, q palindromes, 
and hence contains only finitely many distinct palindromic factors - see Theorem I4.3P . There also 
exist almost rich words that are not weakly rich; for instance, the almost rich word (aabacabaac)^ 
(which has only 2 defects at positions 10 and 11) is not weakly rich as cabaac is a non-palindromic 
complete return to c. 

Our motivation for introducing WR-words will become evident in the next section. 

5 Applications to balance 

A finite or infinite word is said to be balanced if, for any two of its factors u, v with \u\ = \v\, 
we have \\u\ x — \v\ x \ < 1 for any letter x, i.e., the number of x's in each of u and v differs by at 
most 1. Sturmian words are precisely the aperiodic balanced infinite words on a 2-letter alphabet. 

Fraenkel's conjecture [17] is a well-known problem related to balance that arose in a number- 
theoretic context and has remained unsolved for over thirty years. Fraenkel conjectured that, for 
a fixed k > 3, there is only one covering of 7L by k Beatty sequences of the form {\an + (3\) n >i, 
where a, (3 are real numbers. A combinatorial interpretation of this conjecture may be stated as 
follows (taken from [25]). Over a /c- letter alphabet with k > 3, there is only one recurrent balanced 
infinite word, up to letter permutation and shifts, that has mutually distinct letter frequencies. 
This supposedly unique infinite word is called Fraenkel's sequence and is given by (F^)^ where 
the Fraenkel words (i*i)i>i are defined recursively by Fi = 1 and = Fi_ilFj_i for all i > 2. For 
further details, see [25] and references therein. 

In [25] . Paquin and Vuillon characterized balanced episturmian words by classifying these 
words into three families. Amongst these classes, only one has mutually distinct letter frequencies 
and, up to letter permutation and shifts, corresponds to Fraenkel's sequence. That is: 

Proposition 5.1. [25] Suppose t is a balanced episturmian word with Alph(t) = {1,2, ... ,k}, 
k > 3. If t has mutually distinct letter frequencies, then up to letter permutation, t is a shift 

of(F k y. 

In this section, we first show that recurrent balanced rich infinite words are necessarily (bal- 
anced) episturmian words. Then, using a special map, we classify all recurrent balanced weakly 
rich words. As a corollary, we show that any such word (on at least three letters) is necessarily a 
(balanced) episturmian word. Thus, although WR-words constitute a larger class of words than 
episturmian words, the subset of those which are balanced coincides with those given by Paquin 
and Vuillon in [25] . Consequently, WR-words obey Fraenkel's conjecture. Using techniques similar 
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to those in the rich case, we also prove that a certain class of almost rich words (with only a few 
oddities) obeys Praenkel's conjecture by showing that the recurrent balanced ones are episturmian 
or contain at least two distinct letters with the same frequency. 

Before proceeding, let us recall some useful well-known facts about balance (see for instance 
the survey [30] and references therein). 

• In a balanced word, the gaps between successive occurrences of any letter x belong to a pair 
{k, k + 1} for some integer k > 0. 

• Any recurrent balanced infinite word with alphabet A and \A\ > 2 is periodic. 
5.1 Balanced rich words 

The main result of this section is the following theorem. 

Theorem 5.2. Recurrent balanced rich infinite words are precisely the balanced episturmian 
words. 

First we prove some lemmas. For a given letter a, we denote by ip a the morphism defined 
by ip a : a i— ► a, x i— > ax for all letters x ^ a. A noteworthy property of tp a is that tp a {w)a is a 
palindrome if and only if w is a palindrome. It is well-known that an infinite word is epistandard 
if and only if it is generated by an infinite composition of the morphisms i/j x . Moreover, an infinite 
word t is episturmian if and only if F(t) = F(s) for some epistandard word s (see [131 121j). 

Lemma 5.3. Suppose s = ip a {^) f or some letter a and infinite word t. 

i) If s is rich, then t is rich. 

ii) If s is balanced, then t is balanced. 

Proof, i): If false, let wx £ F(t) be minimal such that wx is not rich. If x ^ a then let r be the 
ups of ip a {wx) which is rich by hypothesis. Then ar = ip a (h) where h is a palindromic suffix (but 
not a ups) of wx. Thus h has another occurrence in wx, which implies r has another occurrence 
in ip a {wx), contradiction. 

If x = a we consider the ups of ip a {wa)a = ip a (w)aa and by a similar argument we reach a 
contradiction. 

ii) : If t is not balanced, then it contains two factors u, v of the same minimal length such that 
I Ma — \v\x\ = 2 for some x. Let U = ip a (u), V = ip a {v), then |Z7| = 2|tt| — \u\ a and \V\ = 2|i>| — |u| a . 
By adding to and/or deleting from U, V some a we get U' , V factors of s of the same length. If 
x ^ a then \\U'\ X — \V'\ X \ = 2. If x = a then, as \U\ a = \u\ = \v\ = \V\ a , we get \\U'\ a — \V'\ a \ = 2. 
In both cases, s is not balanced, contradiction. □ 

Remark 5.4. If s = ^a(t) or s = a~ 1 ip a (t) for some letter a and infinite word t, then the letter 
a is separating for s and its factors; that is, any factor of s of length 2 contains the letter a. 

Lemma 5.5. Suppose t is a recurrent infinite word with separating letter a and first letter x ^ a. 
Then t and at have the same set of factors. 

Proof. Clearly F(t) C F(at). To show that F(at) C F(t), let u be any factor of at. If u = a or u 
is not a prefix of at, then clearly u 6 F(t). Otherwise, if u ^ a is a prefix of at, then u takes the 
form axu' where xu' is a prefix of t. As t is recurrent, xu' occurs again in t, and hence u = axu' 
must be a factor of t because the letter a is separating for t. □ 
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This almost trivial lemma allows us to ignore the cases where the separating letter is not the 
first letter. 

Proof of Theorem \5.2l Let s be a rich, recurrent and balanced infinite word. If s has a separating 
letter, a say, then s = ip a (^) or s = a~ 1 tp a (t) for some recurrent infinite word t, which is also 
rich and balanced by Lemmas 15.31 and 15.51 If we can continue infinitely in this way then s is 
episturmian by the work in [21]. Otherwise we arrive at some recurrent infinite word, rich and 
balanced, without a separating letter; call it t. In particular, no xx occurs in t (because x would 
be separating). We call such an infinite word without factor xx, x £ A, a skeleton. Consider any 
factor of form xpx of t with p x-free. By Theorem 12.141 p is a palindrome, and as no square of a 
letter occurs in it, p has odd length. If xp\x and xp2X are two such factors of t, then in view of 
balance, \p±\ = \p2\- Thus x occurs in t with period ir x = \pi\ + 1. Take for x € A the letter with 
minimal ir x (i.e., the letter with the greatest frequency in t). If y is any other letter, as ir y > tt x , 
only one y may occur in a xpiX. By symmetry, this y lies at the centre of pi. Thus all pi are 
reduced to their centre, i.e., ir x = 2 and x is separating, contradiction. So this case is impossible 
and s is episturmian. □ 

As an immediate consequence of Proposition 15.11 and Theorem 15.21 we have: 

Corollary 5.6. Recurrent balanced rich infinite words with mutually distinct letter frequencies 
are Sturmian words or have the form given by Fraenkel's conjecture. □ 

5.2 Balanced weakly rich words 

We now establish a much stronger result, namely that WR-words obey Fraenkel's conjecture, by 
proving that balanced WR-words on at least three letters are necessarily (balanced) episturmian 
words. First we classify all such words. In order to state our classification, we need the following 
notation. 

Let x = X1X2X3 • • • € A u with each € .A, and let a be a new symbol not in A. We define 
(7 4 :i w ->(iU{a}) w by 

<j a (x) = axia ei X2a e2 x^a e ' s . . . 
where 6j € {1, 2}, with a = 2 if and only if Xi = Xj+i. 

Theorem 5.7. Suppose w is a recurrent balanced WR-word with Alph(w) = {1, 2, . . . , k}, k > 3. 

Then, up to letter permutation, w is either: 

1) a shift of the periodic word 

■tp™ o ip 2 o • • • o ip) c ^i(k UJ ) for some n > 1; 

2) or a shift of the periodic word 

o\ o CT2 o • • • o o~j o o if^j+2 o • • • o i/jk_x(k u ) for some 1 < j < k — 2. 

The proof of Theorem GT7] requires several lemmas. In what follows, we assume that Alph(w) = 
A with \A\ > 3. For each a € A, we set g a = sup \u\ where the supremum is taken over all factors 
u of w not containing the letter a. 

First we recall a useful lemma from |29j. 
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Lemma 5.8. \29\ Lemma 6] Suppose w G A u is balanced, and let a G A be such that the frequency 
of a in w is at least 1/3. Then the word w' G (^4\{a}) w obtained from w fry deleting all occurrences 
of the letter a in w is also balanced. 

Lemma 5.9. Suppose w G is a recurrent balanced WR-word, and let a £ A be such that 
9a < Qx f or M x £ A. Then the word w' G (A \ {a})^ obtained from w by deleting all occurrences 
of the letter a in w is also a recurrent balanced WR-word. 

Proof. Clearly w' is a recurrent WR-word; in fact, for each letter x ^ a, every complete return 
to x in w' is a complete return to x in w with all occurrences of a deleted. It remains to show 
that w' is balanced. Since w is balanced, it follows that if alia is a complete return to a in w, 
then each x £ A occurs at most once in U. Otherwise, if some letter x occurred more than once 
in U, we would have g x < g a . Moreover, since w is a WR-word, U must be a palindrome. Thus 
\U\ < 1, and hence g a = 1. It follows that the frequency of a in w is at least equal to 1/3, and 
hence from Lemma 15.81 we deduce that the word w' obtained from w by deleting all occurrences 
of a is balanced. □ 

Lemma 5.10. Let w and w' be as in Lemma \5.9\ Suppose w' contains the factor bb for some 
b £ A \ {a}. Then w is a shift of er (w'). In particular, the complete returns to b in w are of the 
form baab or baxab for some x G A \ {a, b}. 

Proof. Assume w' contains the word bb. Since w' is balanced (see Lemma I5.9D . every factor of w' 
of length 2 must contain at least one occurrence of b, and hence w' contains the factor bxb for 
every x G A \ {a, b}. Since g a = 1 (see Lemma l5Uj) . it follows that w contains factors of the form 
ba l b and ba k xa k b for every x G A \ {a, b} with both l,k > 1. It is readily verified from the balance 
property that if ba kl xa kl b and ba k2 ya k2 b are both factors of w with x, y G A\{a, b}, then k\ = k%. 
Again by the balance property it follows that \2k + 1 — l\ < 1 and k G {1,1 + 1,1 — 1}. If A; = /, 
then \l + 1| < 1 from which it follows that / = 0, a contradiction. If k = I + 1, then |Z + 3| < 1, 
again a contradiction. If k = I — 1, then \l — 1\ < 1 from which it follows that I = 2 and k = 1, 
for otherwise either I or k would equal 0. Thus w is obtained from w' by inserting one a between 
any pair of consecutive distinct letters in w' and two a's between consecutive 6's. In other words, 
w is a shift of <r a (w'), as required. □ 

Lemma 5.11. Suppose w and w' are as in Lemma \5.9\ and let b G A \ {a}. Then bbb is not a 
factor ofw'. 

Proof. Suppose w' contains bbb. Then by Lemma 15.101 w contains the factors baabaab and bacab 
for some c£^4\ {a, b}. But since w is balanced, w cannot contain both aabaa and bacab. □ 

Proof of Theorem \5.7\ We prove Theorem 15.71 by induction on the number of letters k. Suppose 
w is a recurrent balanced WR-word on the alphabet ^3 = {1,2,3}. Without loss of generality 
we can assume g\ < gi < 53. Let w' G {2,3}^ be the word obtained from w by deleting all 
occurrences of 1 in w. First suppose 22 does not occur in w'. In this case w' is a shift of the 
periodic word (23)^ = ^2(3"). So the only complete return to 2 in w' is 232. It follows that there 
exists an n > 1 such that the only complete return to 2 in w' is 21 n 31 n 2. Hence w = o ^2(3^). 
Next suppose 22 occurs in w'. It follows from the above lemmas that the complete returns to 3 in 
w' are of the form 323 and 3223. But if both factors occurred in w', then by Lemma [5.101 w would 
contain both 31213 and 31211213, which contradicts the fact that w is balanced. Thus w' is a 
shift of the periodic wor d (223)^ = ip% (3 U ), and hence by Lemma EM w is a shift of o~\ o tp^^). 
Thus, Theorem 15.71 holds for k = 3. 
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Next take k > 3 and suppose that w is a recurrent balanced WR-word on the alphabet 
Ak = {1,2,..., k}. By induction the hypothesis we assume Theorem 15.71 holds for any recurrent 
balanced WR-word on an alphabet of size k — 1. Without loss of generality we can assume that 
9i < 92 < • • • < 9k- Let w' be the word on the alphabet {2, 3, ... , k} obtained from w by deleting 
all occurrences of 1 in w. It follows from Lemma 15.91 that w' is a recurrent balanced WR-word, 
and hence by the induction hypothesis, w' is either a shift of ^2 ^3 ' ' ' V'fc-i(^) f° r some 
n > 1, or else a shift of 02 o (73 o • • • o o~j o ipj + i ipj+2 o ■ • • o ip^-iik^) for some 2 < j < k — 2. 

First suppose that 22 does not occur in w'. In this case w' must be a shift of ^2 ^3 ■ ■ ■ 
ipk-i^k^). Thus the complete returns to 2 in w' are all of the form 2x2 for some x € {3, 4, . . . , k}. 
Hence there exists an n > 1 such that each complete return to 2 in w is of the form 21 n xl n 2 
where x € {3, 4, . . . , k}. Thus in this case w is a shift of ip™ o ip? ■ ■ ■ ipk-iik^)- 

Next suppose w' contains the factor 22. Then by Lemma |5.11| w' is either a shift of o ip s o 
• • • o tpk-iik^), or a shift of 02 03 ■ • • 0j V^j+i o • • • o ■tp/ : -i(k UJ ) for some 2 < j < k — 2. 
It follows from Lemma 15.101 that w is either a shift of o\ o o ^3 o • • • o ip^-ii^)-, or else a shift 
of o"i o (i2 o • • • o <jj o tpj + i o ^'+2 o • • • o ipk-iik^) for some 2 < j < k — 2. Thus w is a shift of 
<7i o (7 2 o • • • o (7j o V'J+i ^'+2 o • • • o tpk-iik^) for some 1 < j < k — 2, as required. This concludes 
our proof of Theorem 15.71 □ 

Corollary 5.12. Suppose w be a recurrent balanced WR-word with Alph(w) = {1,2, ... ,k}, 
k > 3. Then w is a (balanced) periodic episturmian word. 

Proof. Recall that any infinite word generated by an infinite composition of the morphisms ipi is 
episturmian. Thus o ip2 ■ • • i'k-iik^) is a periodic episturmian word. It remains to show 
that the words described in case 2) of Theorem 15.71 are periodic episturmian words. To do this, 
we use the Fraenkel words (-Pi)j>i (defined previously). It is readily verified that if x = £1X2X3 . . . 
is an infinite word not containing the symbols {1, 2, . . . , n}, then 

cri o cr 2 o • • • o o-j o V>f+i(x) = Fj + iXiFf +1 x 2 Ff +1 x 3 . . . (5.1) 

and 

■01 o ^ 2 ° • ' ' ° ^j+l O 01 (x) = F J 2 + iXiF J 2 + iX2i ? / + iX3 . . . 

It follows that 

(71 O (7 2 O • ■ ■ O O-j O O 0j +2 O • • • O 1p k „i(k U ) = ipi O 1p 2 ' • • Ipj O V'j'+l ^1 ^j+2 O • • • O fc _i (fc W ) . 

Since the right hand side above is an infinite periodic episturmian word, it follows that the periodic 
infinite words listed in Theorem 15.71 are episturmian words. □ 

Hence, by Proposition 15.11 WR-words obey Fraenkel's conjecture; in fact, we can show this 
rather easily without the use of Proposition 15.11 

Corollary 5.13. Suppose w is a recurrent balanced WR-word with Alph(w) = {1,2, ... ,k}, 
k>3. 7/w has mutually distinct letter frequencies, then up to letter permutation, w is a shift of 

Proof. By Theorem 15. 7| w is isomorphic to a shift of one of the two types of periodic words 
listed in the statement of the theorem. We note that except for the extreme case of j = k — 2 
in case 2) of Theorem 15.71 the frequency of the symbols k and k — 1 are equal. Thus, under the 
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added hypothesis that distinct letters occurring in w have distinct frequencies, we deduce that w 
is isomorphic to a shift of 

(Jl O 02 O • • • O <7fc_ 2 ° Ipt-lik")- 



By ()5.ip . we have 

ai o a- 2 o o (Tfc „ 2 o VLi^) = (Fk-iW = Fk-i(Fjf) 
which is clearly a shift of the Fraenkel sequence (Fk) u . □ 



5.3 Balanced almost rich words 

We now extend our study to words having only a few oddities. In the spirit of Lemma 15.31 we 
first prove the following result (see also Theorem 16.261 to follow). 

Proposition 5.14. If s = ^ tt (t), then D(s) > D(t); in particular, if s is almost rich then t is 
almost rich. 

Example 5.15. The periodic infinite word t = {abcbac)^ has 1 defect and s = if; a (t) = (a 2 bacaba 2 c) 
has 2 defects. More generally, for any k > 1, t = {a k ba k ~ l ca k ~ l ba k c) UJ has k defects (see [7]), so 
applying ijj a to t gives a periodic infinite word with k + 1 defects. 

Proof of Proposition \5. lJ\ If s is rich, then t is rich (by Lemma [5.3H . and hence D(s) = D(t) = 0. 
So now suppose that t has at least one defect. Consider any prefix t m of t corresponding to a 
defect, i.e., t m does not have a ups. Let t m = t m -ix where x is a letter. We show that if x ^ a 
(resp. x = a), then ip a (^m) (resp. V'a(tm)a) has no ups and thus gives a defect in s. Let q be the 
longest palindromic suffix of t m which is not unioccurrent in t m since t m has no ups. 

Case x ^ a: p = a~ l ip a {q) is the longest palindromic suffix of ipai^m), which is not unioccurrent 
in it, otherwise q has another occurrence in t m , a contradiction. 

Case x = a: Similar to the above case, but with ifj a (t m )a and its longest palindromic suffix given 

by p = ip a (q)a. □ 

Note. Proposition 15.141 can be extended without difficulty to oddities; that is, if s = V'a(t), then 
O(s) > O(t). 

The main result of this section is the following: 

Theorem 5.16. Suppose s is a recurrent balanced infinite word with alphabet A, \A\ > 2, and 
less than \ A\ oddities. Then s is either episturmian or two of its letters have the same frequency. 

Proof. The proof relies on two lemmas, which are stated and proved below. As in the proof of 
Theorem 1 5. 21 we decompose s as much as possible using morphisms ip x , x £ A. If we can continue 
infinitely, then s is episturmian. Otherwise we halt at some skeleton t without a separating letter 
and with alphabet B. If \B\ < 3, t is Sturmian and hence has a separating letter, a contradiction. 
If \B\ > 2 then by Lemma 15.171 O(t) < \B\. Moreover, by Lemma 15.181 as t has no separating 
letter, it takes the following form (up to a shift): t = {x{ab) n ax{ba) n b) ul , for some n > 1. 

If the decomposition from s to t uses neither ip a nor tpt,, then a and b have same frequencies in 
s (as in t), as claimed. Otherwise we have for instance s = //i^ ^2(t) with tp a , ipf, not occurring 
in /j,2- Then considering factors xab and bob of t we have ^(xab) = fxgagb and ^(bab) = gbgagb 
for some {a, 6}-free words /, g; whence ip a (xgagb) = axhahab and ip a (bgagb) = abhahab where 
h = ip a {g), showing the unbalance axhaha, bhahab. Thus s in not balanced, a contradiction. □ 
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Lemma 5.17. Let m = ip c ( r ) and suppose m is balanced with alphabet A, \A\ > 2, and less than 
\A\ oddities. Then, if r has alphabet B = A \ {c}, r is a (balanced) skeleton with less than \B\ 
oddities. 

Proof. If w = X\x% • • ■ x n is an oddity in r then x\cx<i ■ ■ ■ cx n and cx\ ■ ■ ■ cx n c are oddities in m, 
thus 20(r) = O(m) < which implies 0(r) < \B\ if |.4| > 3. It is also clear that if r is not a 
skeleton, then it contains some aa, whence aca € -F(m). As |^4| > 2, there is another letter b in 
Alph(m), and hence cbc € F(m). Thus m is not balanced, a contradiction. □ 

Lemma 5.18. Let t be a recurrent balanced skeleton with alphabet B, \B\ > 2, and less than \B\ 
oddities and without any separating letter. Then, up to a shift, t takes the form (x(ab) n ax{ba) n b) u) 
for some n > 1. 

Proof. As O(t) < \B\ there is some letter, x say, such that all of the complete returns to x 
are palindromes (of the same odd length); call them xvx, xv'x, xv"x, . . . and write v = uzu, 
v' = u'z'u' and so on. We have \u\ > (otherwise x is separating in t). Consider a factor xvxv'x. 
Suppose firstly that u = u' . If u is not a palindrome, let u = ea ■ ■ ■ be, a ^ b. Then we have 
factors bezeb and aexea, contradicting the balance property. Thus u is an (odd) palindrome, say 
u = wyw. By the same argument, w is a palindrome and so on. Thus u has the form w n for some 
n, with Wi + \ = WiyiWi and W\, yi G B. 

If all u, u' , ... are equal, then the letter w± is separating in t, a contradiction. Thus we have 
for instance u ^ u' . Then u = ■ ■ ■ ae, u' = ■ ■ ■ be with e £ F(t) and a,b E B. The factors aezea 
and bez'eb give z = b, z' = a. Clearly, a and b do not occur in e, otherwise we have for instance 
afbfa and afafa being factors of t with / a- free. So, by the gap property for a, |/| > 2|/|; 
whence f = e. But then aa is a factor of t, a contradiction. Now observe that n, u' have the 
following property. 

Let u(i), u'(i) be the i-th letter of u, u' , respectively. If u{i) {a, b}, then u'{i) = u(i). On 
the other hand, if u{i) € {a,b}, then u'{i) £ {a, 6}. The proof is easy using and u'au' . Thus 
we can write 

u = /oco/ici • ■ • /„c n e, u' = /oc'o/ici • ■ ■ /^e, fo€B\ {a, b} , q, c- G {a, 6} . 

We easily see that fo = e and /j = f n+ \-i, using uxu' and ute. Now consider coexec' in uxn'. 
If Co = c' = a for instance, then as b does not occur in it, b has a gap greater than 2|e| + 2, a 
contradiction. Thus Co = a, c = b for instance. But now for the same reason we have the factor 
baexeba, i.e., fi = f n = £■ It follows that a and b have the same gaps: 2|e| + 1, 2|e| + 2; moreover 
a and 6 alternate in u, u' . Thus 

u = eabf2a ■ ■ ■ bae, u' = ebaf^b • ■ ■ abe (5-2) 

or 

u = ebaf2a ■ • • bae, u' = eabf^o ■ • • abe. (5-3) 

Consider the first case (the second one is similar). The factor baebeab in ubu shows that \ae\ 
is a gap for b, and hence |e| + 1 > 2|e| + 1. Thus e = e and we have 

u = abf^a ■ • -ba, u' = baf2b ■ ■ ■ ab. (5-4) 

Now 1 + \fi\ is a gap for a with 1 + \fi\ < 2; thus fi € B or fi = e . Moreover, considering fiafi + \ 
for instance, we have \fi\ + 1 + < 2; thus if fi is a letter, then and /j+i are empty. 
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Now consider xvxv'xv"x. If v! = u" then factors bxb and aba contradict the balance property. 
Thus v! ^ u" and easily u" = u, z" = z = b; whence, up to a shift, t = (xubuxu' au 1 )^ . 

Any letter y = fi € B \ {x, a, b) gives rise to two oddities, namely afiba and bfiab, and the 
left-most occurrence of y in u gives an oddity: ya ■ ■ ■ x ■ ■ ■ by for instance. Also abxa and baxb 
are oddities. Therefore O(t) > 3([B| - 3) + 2 = 3|B| - 7 and, as 0(t) < this gives [B| < 3. 
Thus all /i are empty and, for some k > 0, u = (ab) k a and v = (ab) 2k+l a. Similarly, the form 
of equation (|5.3p gives v = (ab) 2k a, k > 1. Hence, up to a shift, t = (x(ab) n ax(ba) n b) LO for some 
n > and letter x. □ 

Thus we get another class of infinite words, wider than "rich" , that obey Fraenkel's conjecture. 

6 Action of morphisms 

In this section, we study the action of morphisms on (almost) rich words, with particular interest 
in morphisms that "preserve" (almost) richness. We say that a morphism ip on A 'preserves 
(resp. strictly preserves) a property P of (finite or infinite) words if w & A 00 has property P =4> 
<p(w) has property P (resp. w £ .4°° has property P <3> tp(w) has property P). 

Note. For "richness" , finite or infinite words give the same definition for "preserves" (but not for 
"strictly preserves"). For "almost richness" the definition has meaning only for infinite words. 

6.1 Various results 

Part i) of Lemma 15.31 works in the opposite sense; thus we have: 
Proposition 6.1. Let s = ip a (t). Then s is rich if and only if t is rich. 

Proof. It suffices to show the "if part. If s is not rich, then let wx be the shortest prefix of t 
such that ip a {wx) is not rich. We show first that ip a (w)a is rich. Let p be the ups of in. Then 
ip a {w)a ends with the palindrome ip a (p)a. If this one has another occurrence in ip a (w)a then 
tp a (w)a = gip a (p)aha, h G A* whence w = g'ph', g = ijj a (g'), ah = ijj a (h'), thus h' 7^ e and p is 
not unioccurrent in w, a contradiction. 

Now if x = a the proof is over, otherwise it remains to show that ip a (wx) = tp a (w)ax has a 
ups. Suppose q is the ups of wx (which exists since wx is rich). Then q begins and ends with 
i/a, and hence a~ 1 if) a (q) is a palindromic suffix of tp a (wx). As previously, we easily see that it 
is unioccurrent in tp a (wx). □ 

Remark 6.2. The "if and only if" part of Proposition 16.11 does not extend to finite words. For 
instance, with v = abca, ipa(v) — aabaca is rich while v is not rich. 

Corollary 6.3. Episturmian morphisms strictly preserve richness of infinite words. 

Proof. Proposition 16.11 and Lemma 15.51 show that any elementary epistandard morphism ip a , as 
well as its conjugate ip a : a 1— ► a, x 1— > xa, strictly preserve richness of infinite words. Consequently, 
episturmian morphisms |13l [2~T] strictly preserve richness of infinite words as the monoid of all 
such morphisms is generated by all the ip a , ijj a , and permutations of the alphabet. □ 

Proposition 6.4. For a fixed letter a 6 A, the 'insertion' morphism (p a , defined by (p a : x 1— > xa 
for all x £ A, preserves richness. 
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Proof. Let p be the ups of a rich word u. If p 7^ u then ap a (p) is clearly a ups of tp a (u), but we 
also have to show that <p a {u)a~ 1 has a ups: this one is p a (p)a~ 1 . Now if p = u then (^ a (u)a _1 
is its own ups. Also let u = yt with y £ A and let q be the ups of y. If t is a palindrome, then 
u = a n for some n, a trivial case. Otherwise, let q be the ups of t. Then r = atp a {q) is the ups of 
a<p a (t) and it cannot be a prefix of tp a {u) because otherwise, as q is a prefix of u, we get q = a n 
for some n; whence easily we have a contradiction. □ 

The next proposition deals with a transformation which is not a morphism in general. Let w 
be a finite or infinite word. For any letter a G Alph(w;), if a fc x is a prefix of w (or xa k is a suffix) 
or ya k x occurs in w with x,y 7^ a, we say that A; is an exponent of a in w. Let fci < &2 < • • • be 
the sequence of the exponents of a in w; and let h± < /12 < • ■ ■ be another sequence of positive 
integers of the same length with hi < fcj for all i. Let it a {w) be the word obtained by replacing 
every exponent k{ by /ij in w. Then: 

Proposition 6.5. ir a (w) is rich if and only if w is rich. 

Proof. Suppose w is rich. Then by Theorem 12. 141 the complete returns to any palindromic factor 
of w are also palindromes. The same is true for Tr a (w) since ir a strictly preserves palindromes (i.e., 
a finite word u is a palindrome if and only if ir a {u) is a palindrome). Hence ir a {w) is rich (again 
by Theorem 12. 14f) . The converse is proved similarly. □ 

Proposition 6.6. If tp preserves richness and is prolongable on a G A, then <p u {a) is a rich 
infinite word. 

Proof. This is a trivial consequence of the fact that, for all n > 1, f n (a) is a rich word, since <p(a) 
is a rich word and ip preserves richness. □ 

Note. The converse does not hold. For example, the morphism 5 : a 1— ► a&a, 6 1— > bcb, c 1— » c6c 
generates rich infinite words, beginning with a, 6, and c as easily seen; however, 5 does not preserve 
richness (e.g., 5(acb) = abacbcbcb has a defect at the second occurrence of the letter b). 

Clearly, a morphism <p on A preserves palindromes if and only if <p(x) is a palindrome for all 
x e A. 

Proposition 6.7. Suppose (p is a morphism on A, with \A\ > 1. 7/ y strictly preserves palin- 
dromes, then tp is injective. 

Proof. Suppose <p strictly preserves palindromes and assume <p{u) = <p(v) for some non-empty 
words u, v E A*. Then, with p = uu and q = vv, <p{p) = tp{q) is a palindrome. Indeed, both <p(p) 
and <p(q) are palindromes since ip preserves palindromes, and moreover 

(p(p) = (p(u)ip(u) = <p(v)ip(u) = (p(v)(p(v) = (p(v)(p(v) = (p(q). 

Whence (p(pq) = (p(p) 2 is a palindrome and pq too (since ip strictly preserves palindromes). 
Therefore pq = qp, and hence p and q are powers of a common word (e.g., see Lothaire [23]), i.e., 
p = w m and q = w n . Therefore, since <p{p) = <p(q), we must have m = n; whence u = v. Thus <p 
is injective. □ 

Example 6.8. The non-injective morphism tp : a >— > aba, b \— > bcb, c 1— > aba preserves palindromes, 
but not strictly as tp(abc) = ababcbaba is a palindrome whereas the preimage abc is not. 
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The letter- doubling morphism ip^ defined by ipa : x i— » xx for all x E A strictly preserves 
palindromes; it also preserves almost richness. More precisely, we easily have: 

Proposition 6.9. If t has finite defect k, then (pd{t) has defect 2k. More precisely, if pi, ■ ■ ■ , 
Pk are the k defective positions in t, then the defective positions in <Pd{^) are 2pi — 1, 2pi for 
l<i<k. □ 

Example 6.10. The periodic infinite word t = (a 2 bacaba 2 c) UJ has only 2 defects at positions 10 
and 11, and p(t) = (a A b 2 a 2 c 2 a 2 b 2 a 4 c 2 ) UJ has 4 defects at positions 19,20,21,22. 

A simple example of a morphism that does not preserve almost richness is <p : a i— ► ac, b i— ► 
b,c*—>c. For instance, consider the (rich) Fibonacci word f , which is generated by the morphism: 
a i — > ab, b i— > a. We easily see that the image of f by </? has only six unique palindromic factors 
(e, a, 6, c, aca, cac), and hence ip(f) has infinite defect. 

6.2 Class P morphisms 

We now slightly extend the definition of "class P" morphisms introduced by Hof, Knill, and Simon 
P35] (see also p]). 

Definition 6.11 (Class P morphisms). 

i) A morphism ip on A is said to be a standard morphism of class P (or a standard P- 
morphismj if there exists a palindrome p (possibly empty) such that, for all x E A, <p(x) = 
pq x where the q x are palindromes. If p is non-empty, then some (or all) of the palindromes 
q x may be empty or may even take the form q x = 7T" 1 with ir x a proper palindromic suffix 
of P- 

ii) A morphism ip on A is said to be a morphism of class P (or a P- morphism ) if there exists 
a standard P -morphism ip, with <p(x) = pq x for all x G A, such that, for some factorization 
p = p'p" , we have ip{x) = p"q x p' for all x £ A. That is, ip = T l (ip) for some < i < \p\. 

Remark 6.12. Part ii) of Definition 16.111 tells us that any P-morphism is a conjugate of a 
standard one. Let us also observe that any P-morphism as defined in part ii) may also be a 
standard P-morphism, or a "dual" of a standard P-morphism (of the form x i— > q x p) for other 
p and q x , because for instance if \p'\ < \p"\, then p"q x p' = (p"p'~ 1 )(p'q x p l ) which has form rm x , 
where r, m x are palindromes. Indeed, the interest of part ii) is mainly in view of Definition 16.151 
hereafter. 

Note. The class of P-morphisms (resp. standard P-morphisms) is closed under composition, i.e., 
it is a monoid of morphisms. 

For our purposes, it suffices to consider standard P-morphisms in view of the following trivial 
property. 

Proposition 6.13. Suppose p is a standard P-morphism with (p(x) = pq x for all x G A and let 

ip = T l ((p) for some i, < i < \p\. Then, for any recurrent infinite word t, ip(t) and <p(t) have 
the same set of factors. □ 

Example 6.14. The morphism t : a h baa, b \—> baba is standard P (and its first conjugate 
T(r) : a i— > aab, b \— > abab is of class P). It generates a rich infinite word as does T(r). This 
follows easily from the fact that r = <p>\ o p> 2 with <p\ : a i— > a,b i— > ba and <p>2 '■ a l— ► ba,b i— > bb, 
where the latter two morphisms preserve richness: the first one is episturmian and the second one 
is an insertion morphism (see Corollary 16.31 and Proposition 16. 4|) . 
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Definition 6.15. We say that a standard P-morphism a is special if: 1) all cr{x) = pq x end 
with different letters, and 2) whenever a{x)p = pq x p, with x G A, occurs in some a(yiy2 • • • y n )p, 
then this occurrence is o~(y m )p for some m with 1 < m < n. A P-morphism is special if the 
corresponding standard P-morphism is special. 

Remark 6.16. When p = e, 2) means that the code o~(A) is comma-free (see [6]). Observe also 
that the elementary epistandard morphisms {ip x | x E .A} satisfy this definition. Moreover, as the 
monoid of epistandard morphisms is generated by all the ip x and permutations on A (see |13|. [21] ) , 
any such morphism is a special P-morphism. For example, ip a o tpj, is the special (standard) P- 
morphism with p = aba, q a = e, = a -1 . 

Theorem 6.17. Suppose a is a special standard P-morphism and let t = 2:1X2X3 • • • be a rich 
infinite word. Let h be minimal such that all palindromic factors oft of length at most 2 occur in 
the prefix t^. Then a(t) is rich if (and only if) a(th)p is rich. 

Proof. By induction, we suppose a(t n -i)p is rich for some n > h and show that o~(t n )p = 
a{t n -\)pq Xn p is rich. Let r be the ups of t n . Then R = a(r)p is ups of a(t n )p. Indeed, if 
R has another occurrence in a(t n )p, then by Definition 16.151 this occurrence is o~(xi ■ ■ ■ Xj)p with 
Xi- - -Xj = r and 1 < % < j < n. This implies that r has another occurrence in t n , a contradiction. 
We have also to show that for any factorization ef = q Xn p with e, / 7^ e, a(t n )pf~ l has a ups. 
With r 

— x n r'x n , o~(t n ^pf ^ has a palindromic suffix R — f ^Rf ^ — eR'e. Clearly r' ^ £, 
thus if R' has another occurrence in a(t n )pf~ l then it is ea(xi ■ ■ ■ Xj)pe. As e / e, we have 
Xi-i = Xj + \ = x n and • • • Xj+i = r, a contradiction. □ 

Corollary 6.18. Suppose a is a special standard P-morphism prolongable on a and let be the 
shortest prefix of s = er w (ct) that contains all palindromic factors of s of length at most 2. Then s 
is rich if (and only if) a{sk)p is rich. □ 

This can be extended to defective words. 

Theorem 6.19. Let a be a special standard P-morphism and t be an infinite word with finite 
defect k. Let h be minimal such that the prefix t/i has defect k and all palindromic factors oft of 
length at most 2 occur in t^. Then <r(t) is almost rich and its defect is equal to that of a(tfi)p. 

Proof. Clearly all prefixes t n of t with n > h have a ups of length at least 3. Thus, as in the 
proof of Theorem 16 .17} we find that all prefixes of <r(t) longer than o~(th)p have a ups. □ 

Remark 6.20. Naturally one might suspect that if a is a special P-morphism prolongable on a, 
then o^{a) is almost rich. This is not true, as the following proposition shows. 

Proposition 6.21. The special P-morphism a: a 1— > aba, b 1— > bcb, c 1— > cac generates s = 
ababcbaba ■ ■ ■ which has infinitely many defects. 

Proof. Let p n = a n (a) and let w n be the prefix of s of length (3 n + l)/2, i.e., w n = S( 3 n +1 )/ 2 - 
Then w n ends with some letter, x say, which is in the middle of p n . We show by induction that 
x is the one palindromic suffix of w n . Easily w n+ \x = a{w n ), thus w n+ \ ends with y such that 
xyx = o~(x). If w n +i has a palindromic suffix q other than y, then easily \q\ > 4. So it follows by 
2) of Definition 16.151 that q = yxcr(u)xy for some factor u of s. Hence o~(xux) is a palindromic 
suffix of w n +ix, and therefore xux is a palindromic suffix of w n , contradicting the induction 
hypothesis. □ 
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Indeed we have more generally: 

Proposition 6.22. Suppose a is a special standard P-morphism prolongable on a and let Sh be 

the shortest prefix of s = CT w {a) that contains all palindromic factors of s of length at most 2. 
Then s has infinite defect if and only if cr(sh)p is not rich. 

Proof. ONLY IF: If s has infinite defect, then a(sf l )p is not rich; otherwise, by Corollary 16.181 s 
would be rich, which is a contradiction. 

IF: Clearly s has at least one defect as o~(sh)p is not rich. To show that s has infinitely 
many defects, we suppose by way of contradiction that s has finite defect k > 1. Let s m be 
the shortest prefix of s that has defect k. By the minimality of ?72, has a ups for all n ^ m 
and s m = x\X2 ■ ■ ■ x rn does not have a ups. But the latter implies that a(s m )p does not have 
a ups. Indeed, if 

&(pm)p has a ups, R say, then R begins and ends with o~(x m )p. Moreover, 
as o~ is injective, R = o~(xi ■ • ■ x m )p for some i < m where palindromic suffix 

of s m . But then r must be unioccurrent in s m , otherwise R is not unioccurrent in a(s m )p, a 
contradiction. Therefore a(s m )p does not have a ups (i.e., s has a defect at position |<r(s m )p| > m), 
a contradiction. □ 

Example 6.23. Consider the special standard P-morphism ip : a i— ► aab 2 aa, b i— > bab. By Propo- 
sition 16.221 the infinite words ip^ (a) and <fP (b) have infinitely many defects since their respective 
prefixes (p(aabb) = aabbaaaabbaababbab and (p(babaabb) = babaabbaababaabbaaaabbaababbab are 
not rich (defects at the two penultimate positions in each case). However, if we consider for 
instance the (rich) Fibonacci word f , then 93(f) is a rich infinite word. To show this, we need only 
use Theorem 16.171 the shortest prefix of f containing all of its palindromic factors of length at 
most 2 is abaa and tp(abaa) = aabbaababaabbaaaabbaa is rich; whence 99(f) is rich. This provides 
a good example of a non-periodic rich infinite word that is different from a Sturmian word. It 
was inspired by the family of rich periodic words: (aab k aabab) w with k > 0, given in [7J. 

Remark 6.24. From Proposition 16.221 we see that special P-morphisms generate either rich infi- 
nite words or infinite words with infinitely many defective positions. Moreover, as any (primitive) 
special P-morphism generates a uniformly recurrent infinite word with infinitely many palindromic 
factors, those with infinite defect also have infinitely many oddities (by Proposition 14.8 j) . 

Example 6.25. Using Corollary 16.181 one can easily verify that the following special standard 
P-morphism generates a rich infinite word: a 1— > abb, b 1— > ac, c 1— ► a. 

There is a kind of converse for Theorems 16.171 and 16.191 (cf. Proposition 15.141) . 

Theorem 6.26. Suppose s = <p(t) where ip is a special standard P-morphism. Then D(s) > D(t); 
in particular, if s is rich, then t is rich. 

Proof. It suffices to show that if t = X1X2X3 ■ ■ ■ has a defect at position n, then s has a defect at 
position h = \tp(t n )p\. Otherwise, = <p(t n )p has a ups R beginning and ending with ip(x n )p. 
Thus, as 99 is a special P-morphism, R = <p(xi ■ ■ ■ x n )p for some i < n where r 
palindromic suffix of t n . Now, r must be unioccurrent in t n , otherwise R is not unioccurrent in 
S/j, a contradiction. □ 

Remark 6.27. Notice that property 1) in Definition 16 . 1 5 1 is too strong here; it suffices that 99 is 
injective, i.e., ip(A) is a code. 

From Theorems 16. 191 and 16.261 we immediately see that special P-morphisms strictly preserve 
almost richness. That is: 
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Theorem 6.28. Suppose s = cr(t) with a a special P-morphism. Then s is almost rich if and 
only if t is almost rich. □ 

Using the following easy lemmas, some of which are well-known, we end this section by proving 
a theorem which brings us one step closer to a characterization of morphisms preserving richness. 

Lemma 6.29. If p, q, p' , q' are non-empty palindromes and pq = p'q' is primitive, then p = 
p', q = q' ■ If pq is a primitive palindrome with p, q palindromes, then p or q is empty. 

Lemma 6.30. If pqr is a palindrome with p, q, r palindromes, then (pq) h = (rq) k , for some 
h,keN, (h,k) / (0,0). 

Lemma 6.31. If Xqp is a prefix of (pq) 1 ^ , pq primitive, p, q palindromes, then X = (pq) h p, for 
some h > 0. 

Lemma 6.32. The morphism 9 : a i— > a n , mi for all letter i/a strictly preserves richness. 
Theorem 6.33. Suppose if is a non-erasing morphism on A such that: 

• p(x) 7^ p{y) for all letters x ^ y; 

• p(x) is a primitive word for any letter x 6 A; 

• for any three distinct letters a, b, c, 

p{a) a p{bfp{cf =e, a,/3, 7 GZ a/3 7 = 0. (6.1) 

Then if p preserves richness, it is of class P. 

Proof. Let us denote the images of the letters by pi, 1 < i < \A\. We first show that p(a) = p\ 
and p(b) = p2 have the form given by the definition of class P. As a u is rich, pi^a)^ is rich, so 
by Theorem 13. ll yi = piqi with pi, q\ palindromes. Similarly if2 = P2<?2 with p2, qi palindromes. 
Now, as (a m ba m ) u ' is rich for any m, by the same argument as above, we have ^^^Px = PQ for 
some palindromes P, Q. We shall suppose first that both ipi and (f2 are not palindromes. There 
are three cases according to the place of the separation between P and Q. 

• Case P = <pfX, Q = Yip?, XY = ip 2 , X,Y G ^4*. If m is large, p x q x X is a suffix of P , 
thus Xqipi is a prefix of p™. By Lemma [6.311 X = Pi(qiPi) a - Similarly, Y = {qiPifqi- 
Thus <p2 = Pi(qiPi) a+/3 qi = p^ + ^ +l which is impossible. 

• Case PX = p™, Q = Xp2 l fT> X £ A*. Thus Xp2PX = Q; whence X is a palindrome and 
also P2P, i.e., P2qiP- By Lemma f6.30l and as £>2<?2 is primitive, we have Pq2 = {P2l2)^ +l \ 
and therefore P = (p2<?2) M P2- Consider two subcases. 

— Case \P\ > \p\q\\. As P is a (palindromic) prefix of ip™ ending with q x pi, it has the 
form (piqi) X pi, whence 

(pm) x pi = (P2Q2VP2, a, (i > (6.2) 

— Case \P\ < \piqi\. In this case, \X\ is large and, as it is a palindromic suffix of p™ , 
X = {qiPi) a qi- Thus, since PX = p™, we get P = (piqi) m ~ a ~ 1 pi, and hence P = p\. 
So we again get equation (16. 2|) with A = 0. 
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Now let p = (piqi) X pi = (p2«2) M P2- Then ipx = pxqx = pq a with q a = ((pi<?i) A 1 pi) 1 and 
•P2 = £>2<?2 = pqb with q b = {{p2q2Y~ l P2)~ 1 - 

• Case P = ipfip 2 X, XQ = ipf, X, Y 6 A*. By symmetry we get 

(qm) x qi = feP2)^2 = P, A, n > 0, (6.3) 

and (^i = <? a y>, V9 2 = 

Now suppose for instance that ip-y is a palindrome (but not 922)- Then it is easily seen that 
equation (|6.2p (resp. equation (|6.3p ) also holds with pi = ipi, q\ = e (resp. pi = e, gi = 921). Let 
us also observe that the pair (A, /i) in equation (j6. 2f) or (|6.3p is unique. Indeed if A' > A and 
p! > // also work, we get 9? A ~ A = 9?2 _At ; whence easily </?i = ip2, a contradiction. 

Thus the 'shape' of class P is satisfied for letters a, b, a ^ b. It remains to pass to A in 
totality. Suppose first, with notations as before, that A contains at least two different letters, a, 
b with (px 7^ ip2 both non-palindromes. Let c be any other letter and 92(c) = 923 = p^qz- Consider 
the three pairs of letters. 

• First case: 

Using (a, b): ipx = pq a , <P>2 = pqb] 
Using (a,c): y x = rs a , 923 = rs c ; 
Using (b,c): ip 2 = tu h , 923 = tu c . 

Here, p, q, r are given by equation (|6,2p and similar ones. Suppose for instance \p\ > \r\ > \t\. 
Then we get p = <£>"r, r = ip^t p = ip\t for some a, (3, 7. Thus, 9?" 923I = tp%t. This gives 



a/?7 = by condition (|6.ip ; whence p = r or r = i or p = i. The case r = t for instance 
gives (fx = rs a , 922 = ™&> 923 = rs c . But, by the observation above, r = p. 

• Second case: the same for (a, b) and (a, c), but (6, c) gives 922 = u^t, 923 = u c t and t is given 
by an equation of form (|6,3p . We deduce p = ip± r, pt = tp^, rt = 923 for some £, 77, r; whence 
ifl = tPx<P3- Clearly, pt, rt / e. Thus by (|6.ip . £ = 0, r = p. 

In conclusion, 92 is a P-morphism. 

Now suppose A contains exactly one letter, a, whose image ipx is not a palindrome and consider 
any other two letters b, c. 

• First case: 

Using (a, b): ipx = pq a , 9?2 = Mb\ 
Using (a,c): cpx = rs a , 923 = rs c . 

Here, p, r, are given by equation (16. 2p and a similar one. Suppose for instance \p\ > |r|. We 

have p = (pigi) A pi = y?2 anc ^ r = (Pi9i) A Vi = ¥2 '■> whence ip\ ~ x = ^2 _/J '■ As ipx ^ P2 are 
primitive, A' — A = 0, r = p. 

• Second case: the same for (a, b), but (a, c) gives 991 = s a r, 993 = s c r with r = (qxPi) 6 qi = P>3, 
and hence pr = /p x+6+1 = 922923- AsA + 6> + l>0 this gives \iv = by (16. ip . which is 
impossible. 

Lastly, if all images of letters are palindromes, then ip is trivially of class P. □ 
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Remark 6.34. Condition (|6.ip of Theorem 16.331 is satisfied if (p is injective, or if it strictly 
preserves richness (using the property that a x b y c z a x is not rich). The theorem could be extended 
to non-primitive ip(x) using Lemma 16.321 but conditions should be formulated accordingly. 

Now let us recall from Theorem 14.31 that periodic almost rich words are of the form u u where 
u is a product of two palindromes and that only this property is used in proof of Theorem 16.331 
Thus we also have: 

Theorem 6.35. Suppose if is a morphism satisfying the conditions of Theorem 1 6. 331 Then if ip 
preserves almost richness, it is of class P. □ 

Furthermore, it is not too difficult to see that 'preserves almost richness' could be replaced 
by 'preserves infiniteness of palindromic factors'. This is related to the following long-standing 
open question posed by Hof, Knill, and Simon in [19]: are there (uniformly recurrent) infinite 
words containing arbitrarily long palindromes that arise from primitive morphisms, none of which 
belongs to class PI The answer is believed to be no. Up to now, it has only been shown to hold 
in the periodic case (see [T]) and also in the 2-letter case (see [28]). 

7 Concluding remarks 

To end, we mention a particularly relevant result that gives a good estimate of the palindromic 
complexity of uniformly recurrent infinite words in terms of the factor complexity. Let us first 
recall that the palindromic complexity function V(n) (resp. factor complexity function C(n)) of a 
given infinite word counts the number of different palindromic factors (resp. number of different 
factors) of length n for each n > 0. In [4j, Balazi et al. proved that for uniformly recurrent infinite 
words with factors closed under reversal, 

V{n) + V(n + 1) < C(n + 1) - C(n) + 2 for all n € N. (7.1) 

Infinite words for which V(n) +V(n+1) always reaches the upper bound given in relation (17. ip 
can be viewed as words containing the maximum number of palindromes. Naturally one would 
conjecture that all such words are rich. Indeed, this assertion is true - it was recently proved by 
the first and fourth authors together with M. Bucci and A. De Luca in [8]. Interestingly, its proof 
relies upon another new characterization of rich words, which is useful for establishing the key 
part of the proof, namely that the so-called super reduced Rauzy graph is a tree. 
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